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ABSTRACT 

Let  Xj, . . . , X j and  Yj,  . ,.,Yn  be  independent  random  samples  from  two 
continuous  distribution  functions  F and  G respectively  on  the  real  line.  We  wish 
to  test  the  null  hypothesis  that  these  two  parent  populations  are  identical.  Let 
X*  < ...  <X’  be  the  ordered  X-observations.  Denote  by  S^  the  number  of  Y- 
observations “falling  in  the  interval  [X'  , X' ),  k = 1, . . . , m . This  paper  studies 

the  asymptotic  distribution  theory  and  limiting  efficiencies  of  families  of  test 
statistics  for  the  null  hypothesis,  based  on  these  numbers  {S^}  . Let  h(* ) and 
(h^U),  k = 1,  ...,m}  be  real-valued  functions  satisfying  some  simple  regularity 
conditions.  Asymptotic  theory  under  the  null  hypothesis  as  well  asunder  a suitable 

sequence  of  alternatives,  is  studied  for  test  statistics  of  the  form  Z h(S.  ),  based 

m k=l  k 

symmetrically  on  S ' s and  those  of  the  form  £ h,  (S.  ) which  are  not  symmetric  in 
k k=l  K K 

{S,  } . It  is  shown  here  that  tests  of  the  symmetric  type  have  poor  asymptotic 
performance  in  the  sense  that  they  can  only  distinguish  alternatives  at  a "distance" 
of  n”4  from  the  hypothesis.  Among  this  class  of  symmetric  tests,  which  includes 
for  instance  the  well  known  Run  test  and  the  Dixon  test,  it  is  shown  that  the  Dixon 
test  has  the  maximum  asymptotic  relative  efficiency.  On  the  other  hand,  tests  of 
the  non  symmetric  type  can  distinguish  alternatives  converging  at  the  more  standard 
rate  of  n“2  . Wilcoxon- Mann -Whitney  test  is  an  example  which  belongs  to  this 
class.  After  investigating  the  asymptotic  theory  under  such  alternatives,  methods  are 
suggested  which  allow  one  to  ^elect  an  '’optimal"  test  against  any  specific  alternative, 
from  among  tests  of  the  type  Z h,  (S^)  . Connections  with  rank  tests  are  briefly  ex- 
plored and  some  illustrative  «famples  provided. 
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and  G(y)  respectively.  We  wish  to  test  if  these  two  populations  are 

identical  i.  e. , the  hypothesis  that  the  two  d.  f.  s.  are  the  same.  A simple 

probability  integral  transformation  carrying  z — I(z)  would  permit  us  to 

assume  that  the  support  of  both  the  probability  distributions  is  the  unit 

interval  [0, 1]  and  that  the  first  of  them  is  the  uniform  d.  f.  on  [0, 1]  . 

For  the  purposes  of  our  discussion,  this  probability  transformation  can  be 

done  without  loss  of  any  generality  as  will  be  apparent  soon.  Thus  from 

now  on,  we  will  assume  that  this  reduction  has  been  effected  and  that  the 

* -1 

first  sample  is  from  the  uniform  distribution  U(0, 1)  . Let  G = G • F 
denote  the  d.  f.  of  the  second  sample  after  the  probability  transformation. 

The  null  hypothesis  we  wish  to  test,  specifies 

(1.1)  Hq:  G*(y)  = y,  0 < y < 1 . 

Let  0 < X'  < . . . < X*  <1  be  the  order  statistics  from  the  first 
— 1 — — m-1  — 

sample.  The  sample  spacings  (Dj, . . . , D^)  for  the  X-values  are  defined  by 
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(1.2) 


Dk  * \ - XLl- 


k = 1, . . . , m 


l 
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where  we  put  X^  = 0 and  = 1 . Tests  based  on  these  sample  spacings 
have  been  considered  In  the  literature  for  the  goodness-of-flt  problems. 

See  for  instance  Darling  (1953),  Pyke  (1965)  and  Rao  and  Sethuraman  (1975). 
Define  for  k = 1,  . . . , m 

(1.  3)  = number  of  yy  s in  the  interval  [X^  X'^). 

Our  aim  is  to  study  various  test  statistics  based  on  these  numbers 

{S| Sm)  fortesting  HQ  . Since  the  numbers  {S^} 

as  well  as  the  statistics  based  on  them  remain  Invariant  under  proba- 
bility transformations,  there  is  no  loss  of  generality  in  making  such  a 
transformation  on  the  data,  as  was  done  earlier.  It  may  be  remarked  here 
that  we  take  (m-1)  instead  of  the  usual  m observations  in  the  first  sample 
since  this  yields  m numbers  {S^,  - • - , s } instead  of  (m+1),  leading  to 
slightly  simpler  notation. 

Our  aim  is  to  study  the  asymptotic  theory  as  m and  n tend  to  in- 
finity. We  will  do  this  through  a nondecreasing  sequence  of  positive  in- 
tegers {m  } and  {n  } . We  will  assume  throughout,  that  as  v — <®  , 
v v 

(1.4)  m -*  » , n — oo  and  rn  /n  = r — p.  0<p<«=. 

v v v v v 

Note  that  {D,  } defined  in  (1.2)  depend  on  m the  number  of  X-values 
k v 

and  it  is  more  appropriate  to  label  them  as  {D,  } . Similarly  the  numbers 

kv 

(S.  } defined  in  (1.  3)  depend  on  both  m and  n and  should  therefore 
k v v 
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be  denoted  by  {S,  } . Thus  we  are  dealing  with  triangular  arrays  of 

kv 

random  variables  {D.  , k = 1. . . . , m } and  {S,  , k = 1 m } for 

kv  v kv  v 

v > 1 . Corresponding  to  the  (v  > 1)  array,  let  h (•)  and 

— — v 

{h^^(-),  k = 1, . . . , m } be  real-valued  functions  satisfying 

certain  regularity  conditions  (see  Condition  (A)  of  Section  2).  We  now 

define 


(1.  5) 

and 


(1.6) 


jjt 

based  on  the  (m  -1)  X-values  and  the  n Y-values.  Though  T is  a 
v v v 

special  case  of  T when  {h.  (•)}  do  not  depend  on  k,  we  will  dis- 
v kv 

tinguish  these  two  cases  since  their  asymptotic  behavior  is  quite  different 

in  the  non-null  situation.  We  may  point  out  here  that  the  Wald-Wolfowitz 

. * 

(1940)  Run  test  and  the  Dixon  (1940)  test  are  of  the  form  T while  the 

v 

Wilcoxon- Mann- Whitney  (1947)  test  is  of  the  form  T^  . In  fact,  any  linear 

function  based  on  the  X-ranks  in  the  combined  sample,  can  be  expressed  as 

a special  case  of  T . We  will  discuss  more  of  this  in  Section  7. 
v 

A few  words  about  the  notations:  Though  the  quantities  m,  n,  r, 
as  well  as  the  functions  h(* ),  {h^( * ) } depend  on  v,  for  notational  con- 

venience we  shall  suppress  the  suffix  v,  except  where  it  is  essential. 

m m 

Thus  for  instance,  T = V h (S  ),  T = ^ h(S  ) and  r will  stand  for 

V k=l  k k V k=l  k 

(m/n)  etc.  We  will  also  indicate  the  probability  law  of  a random  variable 


(or  random  vector)  X by  f(X)  . 


N(p,  2)  will  represent  a normal  distribution  with  mean  and  covariance 
matrix  2 throughout  while  N(0,  0)  stands  for  the  degenerate  distri- 
bution at  the  point  zero.  For  0 < x < oo,  Po(x)  will  represent  the  Poisson 
distribution  with  mean  x and 

(1-  7)  "j(x)  = e X • xJ/J  !,  j = 0, 1,  2,  . . . 

the  Poisson  probability  of  J . For  £ = (p^, . . . , Pm)»  mult(n,  jg)  will  de- 
note the  m-dimensional  multinomial  distribution  with  n trials  and  cell 
probabilities  ( pjy  . . . , p^) . Let  r(a,  b)  denote  the  Gamma  distribution 
with  density 

(1.  8)  e X//b.  xa  . p(aj  for  o < x < oo  . 

A negative  exponential  i.  e. , a r(l,  1)  random  variable  will  be  denoted 
throughout  by  W.  ti  will  stand  for  a geometric  random  variable  with  pdf 

(1.  9)  P(r)  = J)  = P/(1+P)j  +1,  j = 0, 1,  2, . . . 

for  0 < p < oo  . 

Also  for  any  random  variable  X , we  write  X = o (g(n))  if 

n’  n p " 

X^/g(n)  — 0 in  probability  and  we  write  Xn  = Op(g(n))  if  for  each  « > 0 , 

there  is  a K < oo  such  that  P{|x  /g(n)  | > K } < t for  all  n sufficiently 
« n c 

large.  Finally  fx]  will  denote  the  largest  integer  contained  in  x . 

We  shall  consider  a sequence  of  alternatives  specified  by  the  d.  f.  s. 

(1. 10)  G*  (y)  = y + (Lm(y))/m6,  0 < y < 1 


-4- 


where  L (0)  = L (1)  = 0 and  6 > | . In  terms  of  the  original  d.  f.  s 
mm  — 

F and  G,  the  null  hypothesis  specifies  G = F,  while  under  the  alterna- 
tives we  have  a sequence  of  d.  f.  s G that  converge  to  F as  the  sample 

m 

size  increases.  Indeed 

(1  11)  L (y)  = m6(G  (F-1(y))  - y)  . 

m m 

We  assume  that  there  is  a function  L(y)  on  (0,1)  to  which  Lm(y)  con- 
verges. For  the  other  assumptions  on  L (• ) and  L(*)  refer  to  Theorems 

m 

3.1  and  4.1.  This  sequence  of  alternatives  (1.10)  is  smooth  in  a certain 
sense  and  has  been  considered  before.  See  for  instance  Rao  and 
Sethuraman  (1975)  or  Holst  (1972). 

The  organization  of  this  paper  is  as  follows;  In  Section  2,  we 
establish  some  preliminary  results.  Theorem  2.1  gives  asymptotic  distri- 
bution of  functions  of  multinomial  frequencies  while  Theorem  2.  2 establishes 
a result  on  the  limit  distributions  of  non-symmetric  spacings  statistics, 
which  is  of  independent  interest.  These  results  are  combined  in  Theorem 

3. 1 to  obtain  the  limit  distribution  of  T under  the  alternatives  (1. 10) 

v 

with  6 = j . It  is  clear  that  putting  Lm(y)  = 0 in  this  theorem,  gives  the 

asymptotic  distribution  of  T^  under  HQ  . Section  4 deals  with  the  symmetric 

$ $ 

statistics  T . Theorem  4. 1 gives  the  asymptotic  distribution  of  T under 
v v 

the  sequence  of  alternatives  (1. 10)  with  6 = \ . Again  putting  L (y)  = 0 

m 

* 

gives  the  limit  distribution  of  T under  H . It  is  interesting  to  note  that 

v 0 

$ 

the  symmetric  classes  of  tests  T^  can  only  distinguish  alternatives 

< 
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converging  to  the  hypothesis  at  the  slow  rate  of  n 4 unlike  the  non- 
symmetric  statistics  which  can  discriminate  alternatives  at  the  more  usual 

_ i 

distance  of  n 2 . Similar  results  hold  for  tests  based  on  sample  spacings 

depending  on  whether  or  not  one  considers  symmetric  statistics.  See  for 

instance  Rao  and  Sethuraman  (1969,  197  5).  Some  further  asymptotic  theory  of 
spacings  statistics  is  currently  under  investigation  by  the  present  authors. 
Asymptotic  efficiencies  and  applications  are  discussed  separately  for  the 
non- symmetric  and  the  symmetric  cases,  in  Sections  5 and  6 respectively. 
Section  7 contains  some  further  remarks  and  discussion. 

2,  Some  preliminary  results? 

The  following  regularity  conditions  which  limit  the  growth  of  the  func- 
tions as  well  as  supply  certain  smoothness  properties,  will  be  needed  in 
connection  with  the  results  of  this  and  the  next  section. 

Condition  (A);  We  will  say  that  the  real-valued  functions  {h^(- ) } defined 
on  (0,1,2,...  } satisfy  Condition  (A)  if  they  are  of  the  form 

(2.1)  hk(J)  = h(k/(m+l),  j),  k = l m J = 0, 1,  2, . . . 

for  some  function  h(u,  j)  defined  for  0 < u < 1,  ) = 0,  1,  2, . . . with  the 
properties 

(1)  h(u,  J)  is  continuous  in  u except  for  finitely  many  u and  the 
discontinuity  set  if  any,  does  not  depend  on  J . 

(ii)  h(u,  J)  ^c  • J + h(u)  for  some  real  number  c . 

(iii)  For  some  6 > 0,  there  exist  constants  c}  and  c^  such  that 

-6- 


_ lit 

(2.2)  |h(u,  J)|  < c.  • fu(l-u)]  2 • (]  +1)  for  all  0<u<land  j = 0, 1,  2,  . . . 

Condition  (A1);  We  will  say  that  the  real-valued  functions  (g^(*)}  de- 
fined on  [0, »)  satisfy  Condition  (A')  if  they  are  of  the  form 

g^x)  = g(k/(m+l),  x),  k = 1,  . . . , m and  0 < x < oo 

for  some  function  g(u,  x)  defined  for  0 < u < 1 and  0 < x < oo  with  the 
properties 

(i)  g(u,  x)  is  continuous  in  u except  for  finitely  many  u and  the 
discontinuity  set  if  any,  does  not  depend  on  x . 

(ii)  g(u,  x)  5*  c . x + g(u)  for  some  real  number  c . 
and  (iii)  for  some  6 > 0,  there  exist  constants  c^  and  c^  such  that 

lit  ^2 

(2.3)  |g(u,  x)|  < c • [u(l-u)]  2 • (x  +1)  for  all  0<u<l  and  0<x<»  . 

We  require  the  following  simple  lemma. 

Lemma  2.1.  Let  h(u)  defined  for  0 < u < 1,  be  continuous  except  for 
finitely  many  u and  have  the  property;  for  some  a > -1,  there  exists  a 
constant  c such  that 

(2.4)  lh(u)|  < c • [u(l-u)]a  . 

Then 

m 1 

(l/m)£  h(k/(m+l))  - f h(u)du  as  m -*  * . 

k=l  0 

Proof;  Define  the  step  function 


MMNMI 
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hm(u>  = h(([mu]-H)/(m+l)),  0<u<l  . 


Then  clearly  h^u)  — h(u)  as  m -*  <»  except  for  finitely  many  points. 
Observe 

|hm(vi)|  < 2c[u(l-u)]a 
1 

for  m sufficiently  large  and  J fu(l-u)]a  < » for  a > -1  . Thus  by  the 

0 

Lebesgue  Dominated  Convergence  theorem, 

m 1 1 

(1/m)  y h(k/(m+l))  = f h (u)du  — f h(u)du 
k=l  0 m 0 

which  proves  the  assertion.  □ 

Turning  to  the  main  problem,  we  will  obtain  the  distribution  of  T 

v 

defined  in  (1.  5)  essentially  in  two  steps.  First  we  consider  the  statistic 

T for  given  values  of  the  X-spacings  D ={  D.  .....  D } . Since  the 

numbers  {S,,...,S  } given  D have  a multinomial  distribution,  we  need 

a result  on  the  multinomial  sums.  We  formulate  this  part  of  the  result  in 

Theorem  2.  1.  The  expressions  for  the  asymptotic  mean  and  variance  of 

this  conditional  distribution  of  T^  given  D,  are  functions  of  D.  In 

Theorem  2.  2,  we  formulate  a general  result  on  the  limit  distributions  of 

functions  of  spacings,  which  allows  us  to  handle  in  particular,  these 

expressions  for  the  asymptotic  mean  and  variance.  Theorem  3. 1 of  the 

next  section  combines  these  results  along  with  other  lemmas  given  there, 

thus  giving  the  required  asymptotic  distribution  of  T . 

v 


It  is  clear  that  the  conditional  distribution  of  the  vector  of  occupancy 

numbers  S=(S,,...,S  ) given  the  spacing  s vector  D = (D  , . . . , D ) is 
= 1 m = i m 

multfn.  D D ).  Therefore  the  test  statistic  T,  conditional  on  D, 

* 1*  * m v = 

has  under  the  null  hypothesis  the  same  distribution  as  the  random  variable 

m 

V 

(2.5)  Z = £ W 

v k=l 

where  (v ) is  Mult(n,  p,,  . . . , p ) with  cell  probabilities 

V m 1 m 

(p.,  . . . , p ) being  equal  to  the  spacings  (D,, . . . , D ) . Since  the 
1 m i m 

asymptotic  mean  and  variance  of  can  be  more  simply  stated  in  terms 

of  Poisson  random  variables,  we  introduce  a triangular  array  of  independent 

Poisson  random  variables  • • • > £m  v > 1 where  is  pP(nvPkv,) 

v 

and  set 

m 

(2.6)  V ' Z , hk(5kv) 

k=l 

u.  = E(k  ),  <r  = Var(X.  ) 
v ’ v v 

(2.7) 

I 

The  following  theorem  on  the  asymptotic  distribution  of  the  multinomial 
sum  Z is  due  to  Holst  (1976a). 

V 

Theorem  2.  1.  Let  (<Pj,  . . . , </>m>  be  mult(n,  pJf . . . , Pm)  and  Z^, 
and  <r  be  as  defined  in  (2.  5),  (2.  6)  and  (2.  7).  For  0 < q < 1,  set 


M - f mn 1 *nd 


\q  * l W 

k=l 


Assume  that  there  exists  a q^  < 1 such  that 


(2.9) 


V p.  -*  P , 0 < P <1  for  q„  < q < 1 , 

^ k q’  q 0 - ’ 


(2.10)  z 


(\  -EX.  )/m2 
vq  vq" 


pV”V/n 


A B 

q q 

B P 

q q 


where  A , B and  P are  such  that  as  q -»  1-  , 

q’  q q ’ 


(2.11) 


A -*•  A,,  B -*  B,  and  P -*  1 . 

q i q l q 


Then  as  v — x , 


(2. 12) 


£((Zv-Pv)/av) 


N(0,  Aj-Bj) 


From  (2.6)  and  (2.7)  an  explicit  expression  for  the  asymptotic  mean 


is  given  by 


m 00 


(2.13) 


>*.  * I,  l \<1>  VnV 


k=l  J=0 


using  the  notation  (1.7).  Under  the  null  hypothesis,  we  have  p^  = , 


k = 1,  . . . , m where  D are  the  spacings  from  U(0, 1)  . Thus  we  consider 


(2.14) 


m oo  m 

P(n9)  = Yj  Z hk(J)  = Z gk(mDk) 

k=l  J = 0 1 k=l 

where 

00 

(2.15)  g (x)  = V h (J)  m (x/r)  . 

K j=0  K J 

Random  variables  of  the  type  (2.  14)  have  been  considered  by  Darling  (195  3), 

Lecam  (1958)  and  Pyke  (1965).  For  the  symmetric  case  i.  e. , when  g^(x)  = 

g(x)  for  all  k , Darling  (1953)  obtained  some  limit  theorems  for  certain 

special  cases  and  Le  Cam  (19  58)  gave  a complete  characterization  of  the 

limit  laws.  See  also  Rao  and  Sethuraman  (197  5)  for  some  results  in  the 

symmetric  case  and  their  asymptotic  efficiencies.  Pyke  (1965)  pointed  out 

(cf.  Section  6.  2)  that  Le  Cam's  method  could  be  used  to  study  the  non- 

symmetric  case.  Since  no  complete  result  or  its  proof  is  explicitly  given 

there  in  a form  useful  for  our  purposes,  we  state  and  prove  such  a limit 

theorem  (Theorem  2.2)  for  non-symmetric  functions  of  uniform  spacings. 

An  especially  useful  form  of  Theorem  2.2  is  given  in  Corollary  2.1. 

The  method  of  proof  we  adopt  is  a mixture  of  the  methods  used  by 

Le  Cam  (1958)  and  Darling  (1953)  and  this  result  on  spacings  is  of  some 

independent  interest.  Let  W.,  W_,  ...  be  independently  and  identically 

1 
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distributed  (i.  i.d.)  r(l,  »)  random  variables  i.  e. , with  a negative  exponential 
- w 

distribution  (1-e  ),  w>0.  Let  {gk(- ),  k = 1,  be  real - 

valued  measurable  functions.  For  0 < q < 1,  let  M = [m  - ql  . Define 

— v v J 


(2. 16) 


M 

go  = y 

qv  k=i 


gk<Wk> 


Theorem  2.  2;  Assume  that  the  variance  of  G exists  and  is  oositive 

qv 

for  all  q and  v i.  e.  , 

(2.  17)  0 < Var(G  ) = o-2  (G  ) < » for  all  q and  v . 

qv  qv 

Assume  further  that  for  each  q e (0,  1] 


(2.  18) 


with  A 

< 

(2.19) 

(2.  20) 


and 


(G  - EG  )/<r(G,  ) \ 
qv  qv  ' lv'  1 

/ 

( A B M 

- N 

q q 

M , 

T,  (W  - l)/m  2 , 

1 1 
3 such  that 

q 

iBq  q / 

/ 

A -*  A = 1 as 

q 1 

q - 

1- 

B -»  B.  as 

q 1 

q — 

1-  . 

Then,  as  v -*  °o  , 


m 

(2.21)  z ((  Y,  9k(mDk)  - EGlv)/<r(Glv))-*  N(0,  1-B2  ) . 

where  Dj,  . . . , Dm  are  spacings  from  U(0, 1)  . 

Proof : Choose  0 < q < 1 . For  z > 0 consider  i.  i.d.  random  variables 


Vp  ' ' ‘ » Vm  Wit^  distribution  i.  e. , exponential  with  mean  1/z  . 


The  sum  ( Vt  •*-  - . . + V^)  has  r(m,  1/z)  distribution,  whose  density  is 


denoted  by  f( - > below.  Using  ideas  of  conditional  expectation,  the 

M 

characteristic  function  of  £ g (mV  ) at  u can  be  written  as 

k=l  k k 


M 

(2.22)  GM(z,  u)  = E(exp(iu  7 gk(mVk))) 


ae  M M 

= J E{exp(iu  £ gk(mVk))|£  vk  = t)}  f(t)dt 


= f {<♦> (t,  u)}  zmtm_1  e Zt/(m-l)!dt,  say. 
0 


m 


Since  the  conditional  distribution  of  (V,,...,V  ) given  V V.  = t is  the 

1 ’ m "T'  k 

same  as  that  of  (tDj, . . . , tD^)  which  does  not  depend  on  z,  it  follows 
that  neither  does  4>(t,  u)  . Observe  that 

M m 

4>(u)  = «>(1,  u)  = E(exp  (iu  £ gk<mVk))|  ^ \ = 1 ) 


(2.23) 


M 

= E(exp(iu  £ gk(mDk))) 


On  the  other  hand  the  function  defined  by 

M 


* oo  oo  M M M 

;M(z,u ) = / •••  / exp(iu  £ gk(mvk))z  exp(-z  £ vfc)  dVj.-.dVj^ 


is  analytic  in  z for  all  complex  z such  that  Re(z)  > 0 . This 
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function  coincides  with  G..(z  u)  on  the  positive  real  axis  and  hence 

M 

for  all  z in  the  right  half  plane.  Consider 


M 


(2.24) 


M 00  00  M M 

(m+iy,  u)  = (m+iy)  f ...  f exp(iu  Y g (mv  ) - iy  Y v ) 

0 0 I 1 


M 

exp(-^  mv^)dvj  ...  dv 


M 


, M 


= (l  + iy/m)  exp(-iyM/m) 


M 


M 


E(exp(iu  £ 9k(Wk  - iy  £ (Wk-l)/m))) 


where  Wj, ...  ,W^  are  1.1.  d.  r(l,  1)  random  variables.  It  is  easily 
checked  that  the  conditions  for  using  the  complex  Laplace  inversion 
formula  (see  for  instance  Courant  and  Hilbert  (1962)  p.  536)  on  (2.22)  are 
satisfied.  On  doing  this  inversion,  we  obtain  for  c > 0 


$(t,  u)  • tm*  = (m-1)  ! (l/2iri)  f eZ  z m G^(z,  u)dz  . 

c-ioo 

Putting  c = m and  t = 1 in  this  formula  and  using  (2.  2 3)  and  (2.  24)  we 
get 

M 

4>(u)  = E(exp(iu  Y gk(mDk))) 


00 

(2.25)  = m!/(2irm)  J em+iy(m+iy)  m (l+iy/m)^  exp(-iyM/m) 

-oo 

M 

E (exp(iuGq  - iy  ^ (Wfc  -l)/m))dy  . 
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Putting  x = y/m  2 and  using  Stirling's  formula  for  m!  we  obtain 


<J>(u)  = e°  (2tt)  2 J exp(ix(m-M)/m2)  • (1+ix/m2) 


I.  M-m 


E(exp(iuGq  - ix  ^ (Wk-l)/m2)dx  . 


From  this,  it  follows  that 


4Mu)  = E(exp(iu(  £ g^m^)  - EG  )/«r(G  ))) 
k=l 


(2.26)  = e°^(2ir)~  ? J exp(ix(m  - M)/m2)(l  + ix/m^) M_I 


E(exp(iu(Gqv  - EGqv)/o-(Glv)  - ix  £ (Wk  - l)/m?))dx  . 
The  integrand  in  (2.  26)  is  dominated  by 


, (M  -m  )/2  2(1  % 

hv(x)  = (1  + x2/mv)  v v h(x)  = e'X  (1'q)/2  as  v -» 


and  it  is  easily  verified  that 


J h (x)dx  -*  J h(x)dx  as  v-*«>  . 


Thus  using  the  extended  Lebesgue  dominated  convergence  theorem  (see  for 
instance  Rao  (197  3),  p.  136),  it  follows  from  assumption  (2.18)  and  the 
formula  (2.  26)  that  as  v -*  oo  f 

i 00  , 

4>v(u)  - (2w)"?  f exp(-(l-q)x  /2) 


exp(-(Aqu2  +2Bqux  + qx2)/2)dx  = exp(-(Aq-Bq)u2/2) 
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The  continuity  theorem  for  characteristic  functions  gives 


s4 


Urn 


M 


(2.28)  £((  V g (mD.)-EGn  )/<r(G,  )))-*  £(X  ) = N(0,  A -B  ) as  v - « 

•y  k k qv  iv  q q q 

Using  an  analogous  derivation,  we  can  show 


m 


(2.29)  r(  Y,  (9k(mDk)  * Eg^Wk))/<r(Glv))-  £(X^)  = N(0, 1 -Aq -(Bj- Bq)  ) 
M+l 


as  v — 00 


Combining  (2.28)  and  (2.29)  with  (2.19)  and  (2.20),  we  get 


(2.  30) 

t(X  ) - N(0,1-B*)  as  q - 1- 

and 

(2.  31) 

£(X^)  - N(0,  0)  as  q -*  1-  . 

Hence  using  the  argument  on  pp.  13-14  of  LeCam  (1958)  the  assertion  of 
the  theorem  follows.  □ 

The  following  corollary  gives  a simple  sufficient  condition  on  the 
functions  gk(*)  in  order  that  Theorem  (2.21  holds. 

Corollary  2. 1 . The  asymptotic  normality  asserted  in  Theorem  2.2  holds 
for  any  set  of  functions  {g^( * ) } which  satisfy  condition  (A‘)  . 

Proof:  To  prove  this  corollary,  we  need  to  check  that  the  assumptions 

(2.17)  to  (2.20)  hold  when  condition  (A')  is  satisfied.  It  can  be  easily 
checked  that  if  g(u,  x)  satisfies  condition  (A1),  then 

oe  00  2 * x 

f g(u,x)e"Xdx,  J g (u,  x)e  Xdx  as  well  as  J g(u,x)(x-l)e  dx 
0 0 0 
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■ ■***. 


satisfy  conditions  of  Lemma  2. 1 in  u . Thus  from  the  definition  of  G 

qv 

and  Lemma  2.1,  as  m -►  oo  , 


[mq]  q 

(2.  32)  E(G  )/m  = (1/m)  £ Eg(k/(m+l),  W )- / Eg(u,  W)du 


[mq]  q 

(2.33)  Var(G  )/m  = (1/m)  Y Var(g(k/(m+l),W  ))-  f (Var  g(u,  W))du 

Qv  k=l  k 0 

and 

M [mq] 

(2.34)  Cov(Gqv,  Y Wk)/m  = (1/m)  £ Cov(g(k/(m+l),  ) 

q 

— J Cov(g(u,  W),  W)du 
0 


where  W is  a r(l,  1)  random  variable.  Again  from  (2.  3)  of  condition  (A'), 
all  these  limits  are  finite.  These  are  also  continuous  in  7 so  that  (2. 19) 
and  (2.  20)  are  satisfied. 

Finally  to  check  the  asymptotic  normality  in  (2. 18)  or  equivalently 
of 

[mq]  [mq] 

(2.35)  £ (a(gk(W^  - Egk(Wk))  + (Wk-1)}  = £ g^),  say 

k— 1 k — 1 


for  all  real  a,  we  have  only  to  verify  the  Lindeberg  condition  for  the 
non-identical  case.  It  is  easily  seen  that  if  (g^-)}  satisfy  condition 
(A*),  so  do  [g*(-)}  defined  in  (2.  35).  Let 

2 *2  7 ("**1  2 

(2.W,  % ■ E,  (Wfc)  and  »[mq)  . ^ % . 
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Since  {g  (•)}  satisfy  condition  (A'),  we  have  as  in  (2.  33)  that 


(2.  37) 


’[mq] 


[mq]  oo  +2 

,/m  = (1/m)  Y f g (w)e'Wdw 
k=l  0 


converges  to  a finite  non-zero  constant  from  Lemma  2. 1.  Now  consider 
2 


[mq]  .2 


<l/,[mq]' 'S  ' /.  . < 

1 1 k=l  x >«  s,  , 

[mq] 


- (m/s[2mq])  ' (1/m)  Z'  / 


[mq] 

k=l  | x | >€  s 


Cj  [(k/(m+l))(l-k/(m+l)] 


-1  +26 


[mq] 


[mq] 


2 2 -x 

(x  +1)  e dx 


< {mCj/s^^H(l/m)  y,  [(k/(m+l))(l  - k/(m+l))]'1+26 } 


2 2 -x 

(x  +1)  e dx} 


X >€  S 


[mq] 

as  m -*  oe^  the  quantities  in  the  first  two  parentheses  remain  bounded 
because  of  (2.  37)  and  I.emma  2. 1 while  the  integral  in  the  third  parenthesis 
qoes  to  zero  for  any  * > 0 since  sj-mqj  *s  °f  order  (\]  m)  from  (2.  37) . 
Thus  the  Lindeberg  condition  is  satisfied  for  (2.  35)  and  thus  the  joint 
asymptotic  normality  required  in  (2.13)  holds  for  functions  satisfying 
condition  (A1).  □ 

3.  Asymptotic  distribution  theory  for  nonsymmetrlc  statistics 

As  mentioned  in  Section  1,  t|  will  denote  a geometric  random  vari- 
able defined  in  (1.  9)  while  W will  represent  a r(l,  1)  random  variable. 
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Where  confusion  is  likely  to  arise,  we  will  denote  the  expectations  with 


respect  to  ri  and  W by  E and  E,,r  respectively.  Then  it  is  easy  to 

H w 

verify  that  for  j = 0, 1,  2, . . . 

(3.1)  P(t!=J)  = p/(l+p)J  + 1=  Ew(wJ(W/p))  . 

If  h(u,  j)  is  a function  satisfying  condition  (A),  we  define  for  later  use, 
the  following  additional  functions. 


(3.2) 

(3.  3) 
and 

(3.4) 


g,(u,  x)  = Y,  h(u,  j)w  (x)  , 
J=0  1 


00 


g (u,  x)=  Yj  h (u,  J)  tr  (x) 
J = 0 


00 


g (u,  x)=  Y h<u»  JHJ-x)n  (x)  . 


1=0 


When  h(u,  J)  satisfies  condition  (A),  these  are  all  well-defined  and 
finite  for  all  p > 0 . Let 


* 

H(u)  = E h(u,n)*y  h(u,J)  P/(1+P)J  + 


(3.5)  - **  “vu» 

11  1=0 


90 


= V h(u,j)Ew(irj(W/p))  = Ewgi(u,w/P) 


for  0 < u < 1 and 
1 


1 


(3.6)  o-  = f Var(h(u,  ri))du  -(/  Cov(h(u,  i-]),  T^du)  /Var(n) 
0 0 

* 1 2 

= Y.  / - H(u»  du  p/<1+p)J 

1=0  0 


J+l 


(Y  /1(h(u,j)  - H(u))du  lp/(l+p)J+1)2  . p2/(l+P)  . 


)=0  0 
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Also  from  the  Cauchy-Schwarz  inequality 

rl  2 r1  i I 2 

if  Cov(h(u,  n),  n)du)  <(J  (Var  h(u,  r|)) 2 (Var  t})2  du) 

' 0 0 

1 1 
< if  Varh(u,  r))du )(f  Var(r|)du) 

0 0 

1 

= Var(r|)  • ( f Var  h(u,  t})du) 

0 

with  equality  if  and  only  if  h(u,  j)  = c • j + h(u)  for  some  real  number  c 

2 

and  some  function  h(u)  . Thus  <r  > 0 for  any  function  h(u,  j)  satisfying 

condition  (A)  . For  x = (x, x ),  we  also  define 

= 1’  ’ m ’ 

m 

v m oo 

(3.7)  h (x)  = Y g (k/(m+l),  x ) = £ V h (j)  it  (x  ) 
k=l  k=l  J=0  1 

and  write  W = (W^,  . . . where  the  components  are  i.  i.d.  r(l,  1) 

random  variables. 

Before  we  proceed  to  state  the  theorem  which  gives  the  asymptotic 
distribution  of  under  the  alternatives,  a few  words  about  the  sequence 
of  alternatives.  Consider  the  Y-observations  from  the  distribution  function 
given  in  (1.  10),  (1.  11)  with  6 = } i.  e.  , 

(3.  S)  A^s  GJy)  = Gm(F'1(y)) 

mm  m 

l 

= y + L (y)/m2,  0 < y < 1 . 

m — — 
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Assumption  (B);  We  will  assume  that  there  exists  a continuous  function 
L(y)  such  that  for  0 < y < 1, 

i -1 

Lm(y)  = m2[Gm(F  (y))  - y]  -*  L(y)  as  m - so  . 


We  also  suppose  that  the  derivatives  L^(y)  and  L'  (y)  = i (y)  exist  and  are 
continuous  outside  some  fixed  finite  set  D C [0, 1]  and  that  finite  left  and 
right  limits  of  the  derivatives  exist  on  the  open  interval  (0, 1)  . 

Given  the  X-sample,  the  probability  of  a Y-observation  falling  inside 
P^k  under  the  null  hypothesis  is  given  by  the  uniform  spacings 

. On  the  other  hand,  under  the  alternatives  (3.8),  this  probability 
is  given  by 


(3.9) 


* 


where  Uj^  k = 1, . . . , m 


U-  = 0,  U1  = 1 and 
u m 

(3.10)  a 

k 


= Gm<r'K»  - VH.,)) 

= Dk(l  + Ak/m~2) 

are  order  statistics  from  U(0, 1)  with 


■ [W  - 


Note  that  D.  > 0 with  probability  one  so  that  A,  is  a well-defined 
k k 

random  variable.  We  now  state  the  main  theorem  of  this  section,  whose 
proof  will  be  completed  in  Lemmas  3. 1 to  3.  7.  The  conditions  of  this 
theorem  are  not  in  the  most  general  form  but  are  adequate  to  cover  all  the 
cases  of  interest  discribed  in  Section  5. 
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Theorem  3. 1 . Let 


m 


(3.11)  V = £ (h(k/(m+l),  S ) - H(k/(m+l)))/m*  • a 


k=l 


with  H(u)  and  a defined  in  ( 3.  5)  and  (3.  6)  respectively.  Assume  that 


(3.12) 


| h(u,  J)|  < Cl(u(l-u))P  (1  + J 2)  . 


In  addition  to  Assumption  (B)  assume  that  for  some  small  e > 0 , 


(3.13)  |L  (t)  - L (s)|<  c,(ta  - s“)  for  0 < s < t < t and  for  (l-e)<s<t.l 

where  7/8<a<l,  p>-y  and  (a  + (3)>l  . Then  under  the  alternatives 
(3.8), 


(3.14) 


£(Vv)  -*  N(b,  I)  , 


where 


b = f Cov(h(u,  T|),  r|)  f (u)du  • p/(1+p)« 
0 


Proof;  As  explained  in  Section  2,  the  vector  (S^,  . . . , S^)  given  D is 

jjc 

mult(n,  D ) where  the  m-vector  D has  the  components  given  in  (3.9). 
Using  conditional  expectations,  we  may  write 


(3.15) 


itcV  _ it<rV  i * 

E(e  v)  = E E(e  v|d  ) 

D* 

= E (J  (D*)  K (D*)) 
D V = V = 
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* 


where 


(3.16)  MD*)  = exp(ltm'  ? f^(n  D*)  - Eh(W/p)J) 

(recall  the  definition  (3.7)  of  and  the  relation  (3.5))  and 

V 

. m 

(3.17)  ly(|  ) ■ HW|(W*»(T  hk(Sk)  -(ilnD |J>|g  } . 


Now  from  Lemma 
E 

D 


3.  4,  It  follows  that 
(J  (D*))  - exp(lbt  - ct2/2) 

* V = 


with  b and  c defined  in  (3.  38)  and  (3.  39)  respectively.  Hence  by  the 

continuity  theorem  for  characteristic  functions 
_ i * 

(3.18)  f(m  ?[nv(nD  ) - Ep<W/p)])  - N(b,  c) 

* 

so  that  J (D  ) converges  in  distribution.  By  Lemma  3.  5,  with  probability 
v = 

* 

one  1.  e. , for  almost  every  random  vector  D , 


(3.19) 


-dt2/2 

e 


with  d as  defined  In  (3.44).  Combining  (3.18)  and  (3.19),  with  probability 

one,  the  product  J (D  ) K (D  ) converges  in  distribution.  But  since 
v = v = 

■ * * i 

I MD  ) K^(D  )|  < 1,  this  also  implies  the  convergence  of  the  moments 
so  that 


(3.  20) 


Kv(D  J)  - exp(ibt 


(c+d)t  /2  ) 
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f 


Using  the  continuity  theorem  for  characteristic  functions  and  Lemma  3.7, 
the  assertion  of  the  theorem  follows.  □ 

Lemma  3.  1.  If  the  conditions  (3.  12)  and  (3.  13)  of  Theorem  3.  1 hold,  then 


(3.21) 


, m » 

m"5  Y V h(k/(m+l),j)  [ir  (nD*)  - ir  (nD.)] 
k=l  j=0  J K J K 

m oo 

= m"  A £ h(k/(m+l),  j)  (J -nD.  ) ir  (nD  ) + o (1) 


k=l  j=0 


where  Ak  is  as  defined  in  (3.10). 


Proof;  Applying  the  Cauchy-Schwartz  inequality  on  the  difference  of  the 
two  sides  in  (3.21),  we  have 


m 


(3.  22) 


m 30  * I 

Yj  hk(j)  [TT^nDj^)  - irj(nDk)  {1  + (j-nDk)  Ak/m  ? }] 


m so 


< m“?  Y y I h ( j )l  * | exp  {j  log(l  + A , /m  2)  - nD  A./m? 
k=l  j=o  k k 


1 - (j  -nDk)Ak/m2Uj(nDk) 


l J?  * 2 


i I 

< m V [V  hJJ)  tt  (nD  )]?  [ V exp  {j  log(l+  A . /m?) 

k=l  j=0  3 K j=0 


-nDkAk/mt)  - 1 - (j-nDk)Ak/m^|21rj(nDk)]^ 


After  some  elementary  calculations,  we  see  that  the  term  in  the  second 
square  brackets  is 
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(3.2  3) 


eXP(DkAk/f)  ' 1 ‘ Dk  Ak/r 


Since  h(u,  x)  satisfies  condition  (A1),  using  Theorem  2.2  and  (3.2  3),  It 
is  clear  that  the  right  hand  side  of  (3.  22)  can  be  estimated  by 


.1  m 

m ? Y Cj-  g(k/m+l)(l  + (mDk)  Op(DkA^) 


(3.24) 


2 I 

= max  (A.  /m?)  • O (1)  . 
k k p 


2 - 

Now  we  show  that  this  max(A^/m2)  goes  to  zero  in  probability  when  a 
Observe  that  by  (3.  13) 


lik^l=lLm<Uk»-Lm(Ult-l,l/mi'Dk 


(3.25) 


< («■*-  ■ D„ 


< D“/m=  . Dk 


since  (t^  - sa)  < (t-s)a  for  0 < s < t < 1 and  a < 1 


Also  from  Darling 


(1953)  for  any  e > 0,  we  have 


2 +€ 


1 / min  (m  D ) = 0(1)  . 
l<k<  m 


Therefore  from  (3.  2 5) 


(3.  26) 


max  I A , /m 4 1 < O (m*2+*  ^_Q^~  4 ) 
1<  k < m 
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’•Vrtfci/  •>>  ».-*  -iti--  • * 


>7/8. 


Since  a > 7/8,  by  choosing  0 < t < A(l-a)  - 2,  max  | a k/m4  | -» 
probability.  This  proves  the  lemma. 


Lemma  3.  2.  If  the  conditions  of  Theorem  3. 1 are  satisfied  then, 
< > 0 . 


(3.27)  lim  sup 
m—  so 


i fm«  I » 

m V A V h(k/(m+l),  j)  (J-nD  ) it  (nD  ) 
k=l  J=0  K 1 


with  probability  one. 


Proof:  We  have 


X.  hk(J)(J-nDk)  trJ(nDJc) 


1 = 0 


■ <nDk,|,  V"  ’r)-i(nDk)  - (nDk'|0  hk°>  'i(nlV 


= (nDk)  fhk«+l)  - hk(J)]  irj(nDk) 
= nDk  g4(k/(m+l),  nDk),  say 


where 


(3.28)  94(u,x)  = X fh(u,  j+1)  - h(u,  j)]  ir  (x) 


1 = 0 


satisfies  condition  (A1)  . Hence  the  expression  in  (3.27) 
[m«  1 so 

7 

1 = 


(3.29) 


m 


i[nk]  30 

X X h(k/(m+l),  1)0  - nD  ) XT  (nD  ) 

k=l  1=0  J 


[mt  ] 

- 7 !Lm<UP  - 94(k/(mkl),  nDk, 

k=l 
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0 in 

□ 

for  any 
K**+0 


r 


Using  condition  (3.13),  and  writing  M = [mt  ],  this  is 


M 


(3.  30) 


l[fVuit>-  Vuic-i'l  V1'/""*1'-  "Dk»l 

M c 

- °1C3  £ (Uk*  • U'^Hk/mrfl*  (nDk)  2]  . 


We  now  make  use  of  the  representation  of  the  spacings  in  terms  of 

k 

i.  i.d.  exponential  r.v.  s ...  with  mean  1 . Writing  W^r  Vw/k, 

the  RHS  in  (3.  30)  is 


a+P  M 


= C 


' rV;c'  £ [vvJWm)®  - wJ^k-VMj^k/M^.  (wk2  + wm2) 

Wm  2 


(3.  31) 


M 


= C • € 


? 


C +1 


• «a+P  . W (a+C2>  • M_1  V w0”1  2 


-c2 
I A T *-  V 


Wm'“  ~2'  • M 2j  W“  4 (k/Mf  Tp‘*(W/  + Wu  • W/) 


k m 


• f{i  - (i-wk/k.wk)°}/(wk/kwk)] 

Now  by  the  strong  law  of  large  numbers  Wk  — 1 a.  s.  as  k — * and 


hence 
(3.  32) 


{1  - (l-Wk/kWk)0f}/(Wk/kWk)  - a as  k - *>  . 


Using  the  Holder  inequality, 

M . , c +1 

)wl *1(k/M)a+P“1V 

1 


1 1V1  1 1 C - 

M Jl?  (k/M)“+P  Wk2  [{l.(l.Wk/kWk)a}/(Wk/kWk)] 


!/p, 


1 VKi  « M 

,5-53)  fMf'i  'i  is  $<“/“> 

f T w 

lM  ) 


(or+p-l)p  1/p 

] 


iT.'V"',/,  fJ_  “ fl-<l-wk/kWfc)Q’1P4l'/P4 

* 1 ' 'm  f '^vnr 
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i'  rviftmrjriil 


1 as  n — oo,  the 


-1 

Using  the  fact  that  if  a,  — 1 as  k -*•  «,  n V a,  — 

k 1 k 

RHS  in  (3.  33)  converges  a.  s.  to  the  finite  limit 

1 (o+(3-l)p  1/p  (c  +l)p  1/p 

1 [/  u du]  [EW  & 3 a . 

0 

_c2 

Similarly  the  other  term  involving  W • W ) in  (3.  31)  can  be  handled  so 

k m 

that  we  get  the  desired  result.  □ 


Lemma  3.  3.  Under  the  conditions  of  Theorem  3. 1, 
j m oe 

m’  V A £ h(k/(m+l),  j)()  - nD  ) ir  (nD  ) 
k=l  K J=0  1 K 

1 

— f t (u)Cov(h(u,  r|),  ri)du  • (p/l+p)  in  probability. 

‘ 0 

Proof;  As  we  have  seen  in  (3.29), 

j m » 

m E Ak  ! h(k/(m+l),  J)  (J-nD  ) tt  (n  D ) 
k=l  J=0  k ) K 

(3.  34) 

_1  m 

= m V Ak(nDk)  g4(k/(m+l),  nDk) 
k — 1 

where  g (u,  x)  is  as  defined  in  (3.28).  For  any  fixed  € > 0,  we  may 

[m«  ] [m(l-c )] 

consider  the  sum  in  (3.  34)  as  consisting  of  3 parts  viz.  ^ and 

m k=l  kt=[m*  ] 

Y . Lemma  3.  2 shows  that  the  first  sum  is  negligible.  A similar 

k=[m(l-« )] 

analysis  can  be  used  to  demonstrate  that  the  third  term  is  also  bounded 
a.  s.  by  K t . 

Now  it  remains  for  u',  to  show  that  in  probability 
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rr 


j» 

A k(nDk)  g4(k/(rrn-l),  nD  k>-J  *(u)  • E(g4(u,  W/p)W/p)du 
k=fm«  ] * 

1 -<  30 

= f l(u)E(J  h(u,  j)(j-W/p)  rr  (W/P))du 
€ J = 0 J 

1 -«  30 

= / #(u)  V h(u,  j)(j-(jflj/(l+p))  P(r1=J)du 

1 = 0 l-< 

= f t( u)  • Cov(h(u,  r)),  ri)du  p/(l+p) 

t 

The  proof  of  Lemma  3.  3 will  then  be  completed  since  we  can  choose  t 
arbitrarily  small. 

By  our  assumption  L'  (y)  = / (y)  exists  and  is  continuous  except 

m m 

possibly  for  a finite  number  of  points  on  (,,  1-0  . If  <m(y>  Is  con- 

tinuous,  then  boundedness  of  / (y)  along  with  the  fact  fxg  (u,x)] 

m 4 

satisfies  condition  ( A* ) allows  us  to  apply  Theorem  2.2  as  follows:  From 
the  Glivenko-Cantelli  theorem, 


— 0 with  probability  1 . 


Also  from  Theorem  2.  2, 

[m(l-t )] 

m"  V nDk-  g4(k/(m+l),  nDk  ) = 0 (1)  . 
k =[  m«  ] 


max  A , -l  (k/m+1) 
. k m 


m 

r 

| 

(?.  35) 


Hence  the  sum  in  (3.  35)  has  the  same  probability  limit  as 

[m(l-€)J 

(3.36)  m V lm(k/(m+l))- nDk- g4(k/(m+l),nDk) 

k=[mt  ] 

which  from  Theorem  2.  2 is  the  required  limit  given  in  (3.  35). 

Now  if  I (•)  has  a finite  set  of  discontinuity  points  inside  («  , l-€)  , 
m 

this  will  not  create  any  problems  since  the  function  is  bounded  in  this 
interval. 
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< 


i 


Suppose  now  that  /^(y)  Is  continuous  in  (0,1)  except  at  y = yQ  . 

By  our  assumptions  t (y)  has  finite  left  and  right  limits  at  this  point  and 

m 

the  point  does  not  depend  on  m . Take  6 > 0 so  that  0<yQ-6<yo  + 6<l. 
From  our  assumptions  and  the  Glivenko-Cantelli  theorem  it  follows  that  with 
probability  one  is  bounded  whenever  |k/m-yQ|<6  and  m is 

sufficiently  large.  From  this  it  is  easily  seen  by  analogous  arguments  that 
the  contribution  to  the  sum  (3.  35)  from  such  terms  in  the  neighborhood  of 
yQ  can  be  made  arbitrary  small  by  choosing  6 sufficiently  small.  It  is 
obvious  that  the  situation  of  a finite  set  of  discontinuities  of  the  first  kind 
can  be  handled  the  same  way,  if  the  discontinuity  set  does  not  depend  on 
m . This  completes  the  proof  of  Lemma  3.  3. 


Lemma  3. 4.  Let 


Jy(D*)  = exp(itm"  J[MnD*|  - Ep(W/p)]) 


be  as  defined  in  (3. 16).  Then  under  the  conditions  of  Theorem  3.  1, 


2 

(3.  37)  E(J  (D  ))  - exp(ibt  - ct  /2) 

\j  = 


where 

1 

(3.  38)  b = f Cov(h(u,  n),  r,)  /(u)du  p/(l-H>) 

0 

and 

1 1 2 

(3.  39)  c = f Var  g (u,W/p)du  - ( f Cov(V\{  g (u,W/p))du)  . 

0 0 
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Proof;  We  can  write 


(3.40)  JJD*)  = exp(itm‘2  ([^(nD*)  - p^(nD)]  + fpJnD)  - EpJW/p)])  . 


v = 


In  Lemmas  3.  1 to  3.  3,  we  already  established  that  the  first  part 


1- 


m 2fp  (nD  ) - u (nD)l  converges  in  probability  to  b . Thus  we  need  only 


V = 

show  that 


(3.41) 


_I  2 

E(exp(itm  2[p^(nD)  - Ep^(W>)]>-  exp(-ct  /2) 


From  the  Condition  (3.12)  on  h(u,  J),  it  follows  using  moments  of 
the  Poisson  distribution,  that 


(3.42) 


|g  (u,x)|<  Y,  c (u(l-u)f  (1+x  ) w (x) 

J=0  1 

c' 

< cj  (u(l-u))P  (1  + x 2)  . 


Thus  g^u,  x)  satisfies  condition  (A1)  . Hence  Corollary  2.1  of  Section  2 

holds  and  the  asymptotic  normality  of 

mv 

P (nD)  = V g (k/(m+l),  nD.  ) 
v ~ k=l  1 K 

is  assured  by  Theorem  2.  2.  Further  Var(g^(u,  w/p))  and  cov(w,  gj(u,  W/p)) 

as  functions  in  u,  satisfy  the  conditions  of  Lemma  2. 1,  so  that  as  v -*  *> 

1 m m m 

m"  Var(£  g^k/fm+l),  Wk/p)  - m Cov  ( ^ Wk,  £ g(k/(m+l),  wk/p)) 


1 1 

(3.43)  -/  Var(g  (u,  W/p))du  - ( f Cov(W,g  (u,  W/P))du)‘ 

0 0 1 


= c . 


□ 
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Lemma  ^ . Under  the  assumptions  of  Theorem  3.1,  with  probability  one 


i.  e.  , for  almost  every  D -vector, 


where 


m 


MD  ) = E(exp(itm'*[^  h^)  - p(nD  )])|  D*) 


- exp(-dt  /2)  in  probability 


1 2 1 2 
(3.44)  d=  f E[g  (u,W/p)  - g (u,W/p)]  du  - p(  f Eg  (u,W/p)du)  . 

0 2 1 0 3 
Proof:  The  lemma  will  be  proved  by  verifying  that  the  conditions  of 


Theorem  2.  1 hold  and  showing  that  d = - b~  . First  we  have  by  the 

Glivenko -Cantelli  theorem  that  with  probability  one 
M ^ , 


(3.4S) 


£ D = U*  + m ? L_(U'W)  — q = P 
k=l  k * 


k M ' “m'  “ M'  q 


where  M = [mq]  and  U’K  is  the  kth  order  statistic  from  U(0, 1)  . Clearly 
since  P^  = q — 1 as  q — 1-,  conditions  (2.  9)  and  part  of  (2.  11)  of  Theorem  2.  1 


hold. 


For  real  numbers  a and  b,  consider 


(3.46) 


hj(u, j)  = ah(u, ))  + bj 


It  is  easy  to  verify  that  if  h(u,  j)  satisfies  condition  (A),  then  so  does 


hj  (u,  j)  . Consider 
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(3.47) 


E (hjMm+l),  tfc)  - Ehjtk/fm+l),  |k)) 


where  are  independent  and  4.  is  P tnD*)  . From  the 

1 m ^k  0 k 

assumptions,  it  follows  that  for  some  positive  constants  cj,  c we 


V(^)  = m_1  E Var(h1(k/(m+l),ek)) 


(3.48) 


ivi  ^ 

< m~  c Y [(k/(m+l))(l -k/(m+l))]P  ((nDJ  2 + 1) 

k=l  K 

1 m ^ c 

< m'  cj  E [(k/(m4l))(l-k/(m+l))f  (nDJ  2 + c 


m * c4  Cc 

< c"  ( E (nD  ) */m)  + c. 

1 k=l  K 


by  the  Holder  inequality  and  Lemma  2.1.  From  the  assumption  (3.13), 


(3.49) 


< nDk  + KlDkm^K2Dk  m» 


< K3(mDk)  + K2(mDk)  m; 


ill 

rsing  the  representation  D,  = W / V W.  where  W,,W,f...  are  i.  i.d. 

k k 4'  k 1 2’ 


T(l,  1)  random  variables,  it  follows  by 

for  c > 0 , 


law  of  large  numbers  that 
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lim  m 
m -*oo 


-1  r-1  C 

y;  (»Dki 


is  finite  with  probability  one.  As  a > we  have, 
theorem 

-1  o * C6  -1 

m V (mD  ) < m K 

T 


m 


(K^  mDk  + K2(mDk)tt 


using  the  binomial 


-a 


C6+1 

1)  6 -*  K 


with  probability  one.  Thus  with  probability  one 


(3.  50)  lim  sup  Var(  < *>  . 

Now  we  will  verify  that 
(3.  51)  lim  inf  Var(^)  > 0 . 

By  assumption  (A),  it  follows  that  there  exists  an  interval  [a,  b]  C (0, 1) 
and  integers  Jj*J2  such  that  h^u,  * h^u,  J2)  for  a<u<b.  Again 
from  the  strong  law  of  large  numbers  and  our  assumptions,  it  is  easily  seen 
that  for  any  0 < C<  D < *>,  with  probability  one 

# {k;  a < k/(m+l)  < b,  C < nD*  < D } /m  - ^ > 0 . 

Therefore  for  n sufficiently  large, 

Var(£q)  1 Tj  Varfhjfk/tm+l),  4k))/m  > K2  > 0 
a(m+l)<k<b(m+l) 

with  probability  one.  Hence  (3.  51)  is  satisfied  with  probability  one.  In  a 
similar  fashion  it  follows  that 
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m 


lim  sup  E|h  (k/(m+l),  4 ) |2+< /m  < » . 
k=l  1 


Therefore  the  Liapunov  condition 


M 


£ E I hjfk/fm+l),  4k) \ZU  /(m  Var(4'  )) 


..»!+«/  2 


1 


M 


(3.52)  =m"</2  V E|h1(k/(m+l),4k)|2+€/(Var(^))1+</2- 


m 


0 as  m -*  oo 


is  satisfied  with  probability  one.  Thus 


X<^/<Var(^))*)-*  N(0, 1) 


with  probability  one.  By  the  next  Lemma  3.6,  we  have  that  in  probability 

Var(t,'  ) -*  a2  A + 2abB  p 1 + b2q  p S 
*q  q q ' 

where  A -►  A.,  B -*  B,  as  q -*  1-  . This  verifies  that  the  assumptions 

q r q l 

of  Theorem  2.1  are  satisfied  with  probability  one.  From  the  definition  (3.44) 


of  d as  well  as  the  expressions  (3.  56)  and  (3.  57)  for  A^  and  B^,  it 


follows 
(3.  54) 


d = Ai  - Bi 


which  proves  the  lemma. 

* 


□ 


Lemma  3.6.  Given  D , let  (4 ,4  ) be  independent  and  4,  be 

_ *k 

* 

Ft>(nDk  ) . Under  the  assumptions  of  Theorem  3. 1 
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• 0*M 


rWv’.  * 


-I’?  Z --2-1 

(3.  55)  m V Var(h  (k/(m+l),  ))  - a"  A + Zab  B p 2 + b qp 

■j  1 k q Q 

in  probability  where 

q 2 
(3.56)  A =f  E[g  (u,  W/p)  - g (u,W/p)]  du 

q •'o  2 1 

and 

I 

(3.  57)  Bq  = p 2 J E(g3(u,  W/p))du  . 

Proof;  Recall  from  (3.  30)  that  hj(u,  j)  = a h(u,  J)  + bj  . By  calculations 

similar  to  those  in  Lemma  3. 1,  it  follows  that,  for  instance 

M * 2 * 

rn  v ^ h (k/(m+l),  j)  [w  (nD.  ) - w (nD  )] -*  0 
k=l  1=0  1 1 

in  probability.  Using  Theorem  2.2,  we  get 

M q 

m'1  Y li  (k/(m+l),  j)  t.(nD,  ) - f (Eg  (u,W/p))du 
k=l  J k 0 2 

in  probability.  Therefore 

, M £ ^ 

m V Eh  (k/(m+l),  £,  ) - f (Eg  (u,W/q))du  . 

1 * 0 

The  other  terms  can  be  handled  analogously  which  proves  the  assertion.  □ 
Lemma  3.  7 . 

. 2 

(3.  58)  c + d = <r 

2 

where  c,  d,  a are  defined  in  (3.  39),  (3.44)  and  (3.6)  respectively. 

Proof;  Let  W be  a r(l,  1)  variable  and  let  ri*  be  a random  variable 
such  that  T) 1 given  W is  Bo(W/p)  . Then  the  unconditional  distribution 


of  r|  ‘ is  given  by 
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(3.  59) 


I 


P(n*  = J)  = / e_W-  (W/p)j  e'(W/p)/J! 


dW 


= p/(l+p)J+1,  j = 0,1,2,...  . 


Thus  n*  has  the  same  distribution  as  the  geometric  random  variable  r) 
defined  in  (1.  9).  Let  E V denote  the  expectation  and  variance  over 

w w 

W while  E |w  v |w  denote  the  conditional  expectation  and  variance 

over  r|  given  W.  Then  from  the  definitions  of  g , g , g 

12  3 

(3.60)  E^ h(u,  t|)  = EwETi|wh(u,T1)  = Ewgi(u,W/p) 

(3.61)  E^h^u.r,)  = EwEri|wh2(u,T1)  = Ewg2(u,w/P)  . 

And  after  some  elementary  calculations, 


(3.62) 


p(l+p)_1  Cov(h(u,  n),  n)  = Ew  ^ I v\Zh(u’ 

= Cov(g1(u,W/p),  W)  = Ewg3(u,w/p). 


Now  from  the  definitions  (3.  39)  and  (3.44)  of  c and  d and  from  identities 
(3.60),  (3.61)  and  (3.62),  we  get 


Mi* 
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r 


* X 

c+d  = J Var(g  (u,W/p))du  - ( J Cov(W,  g (u,  W/p))du)Z 
0 0 1 

r1  2 * 

+ J E[g2(u,W/p)  - g (u,W/p)]  du  - p(J  Eg  (u,W/P)du)‘ 


/ VE,|w(h(u^»»du  + / Vv,|w(h<u>  ,',)>du 

1 

- (1+p)  [f  Cov(h(u,  n),  ri)  p/(l+p)du]2 


r1  r1  22 

= J Var(h(u,  n))du  - [J  Cov(h(u,  n),  p)du]  p /(1+p) 

0 0 


= cr 


These  lemmas  3. 1 to  3.  7 complete  the  proof  of  Theorem  3.  1.  The 
following  lemma  gives  a simple  sufficient  condition  for  (3.13)  hold. 

Lemma  3.  8 . A sufficient  condition  for  (3.13)  to  hold  in  a neighborhood  of 
the  origin  is  that 

0 < L'  (u)  < c - u°  ' for  0 < u < t . 

— m — 

Proof.  We  have  for  0 < s < t < « 


0 < f (cua_1  - L'  (u))du 
~ J m 


or 


= c(t"-sV«  - (Lm(t)  - Lm(s)) 


Lm(t)  ' Lm(s)  - S<V<*  • 


Since  L (0)  = 0 and  L'  (u)  > 0,  the  assertion  follows, 
m m — 9 


m 
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□ 


Corollary  3. 1 . Under  the  null  hypothesis  (1.1),  the  asymptotic  distribu- 


tion of  defined  in  (3.11)  is  N(0,  1)  . 

This  result  is  a direct  consequence  of  Theorem  3. 1 and  is  obtained 
by  taking  i(u)  = 0,  0 < u < 1 in  (3.  14).  This  corollary  regarding  the  null 

distribution  of  can  also  be  reformulated  in  the  following  interesting 
form  using  Lemma  2. 1. 


Corollary  3.  1‘  . Let  r^,  r^,  • • • be  a sequence  of  i.  i.  d.  geometric 

random  variables  with  pdf  given  in  (1.  9).  Then  the  asymptotic  null  dis- 
m m mm 


tributionof  ^hk(Sk)  is  N(E(£  h^)),  Var(Y(  W - P ^ bk» 
p is  the  regression  coefficient  given  by 


where 


m 

r 

i 


m 


m 


P = Cov(E  hk(hk)»  E hk)Aar(^  nk) 


Remark;  See  also  Holst  (1976b)  for  a different  proof  of  this  result. 


4.  Asymptotic  distribution  theory  for  symmetric  statistics 

This  section  deals  with  the  class  of  statistics  symmetric  in 
{S^,  . . . , S } i.  e. , statistics  of  the  form 

. m- 

(4.1)  T =£h(S) 

k=l  K 

for  some  given  function  h(J)  . Clearly  this  class  of  symmetric  statistics 

is  also  covered  by  the  asymptotic  theory  discussed  in  the  last  section. 

Indeed  if  the  function  hk(J)  does  not  vary  with  k i.  e. , the  function 

h(u,  J)  of  the  last  section  is  a function  only  of  j and  is  independent  of  u , 

then  we  obtain  the  symmetry  in  the  numbers  {Sj,  . . . , S } . But  since 
1 

f l(u)du  = 0,  it  follows  from  Theorem  3.1  and  Corollary  3.1  that  the 
0 
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uwmmm 


asymptotic  distribution  of  T under  the  sequence  of  alternatives  ( 3.8)  coincides 
with  that  under  the  null  hypothesis.  Thus  symmetric  statistics  of  the  type  (4.1) 

_ i 

can  not  distinguish  alternatives  that  are  at  a 'distance'  of  n”  2 and  have 
power  zero  against  such  close  alternatives.  Therefore  in  order  to  make 
efficiency  comparisons,  we  have  to  consider  the  more  distant  alternatives 
with  6 = l in  (1. 10).  Let 

(4.2)  A^\  G*  (y)  = y + L*  (y)/m4,  0<y<l 

with  L*(u)  = m 4(G  (F‘V’.))  - u)  . 
m m 

For  this  symmetric  situation,  we  will  make  the  following  slightly 
stronger  assumptions; 

Sjc  jjf 

Assumption  (B  ) . Assume  Lm(u)  is  twice  differentiable  on  [0, 1]  and  there 

* 

is  a function  L (u),  0 < u < 1,  which  is  twice  continuously  differentiable 

and  such  that 

(4.  3)  L (0)  = L*  (1)  = 0,  sup  |L  (u)  - L*  o(u)|  = o(l) 

0<u<l  m 

where  f (u),  t (u)  are  the  first  and  second  derivatives  of  L (u)  . 

Notice  that  for  such  smooth  alternatives,  the  following  also  hold; 

(4.4)  sup  I L (u)  - L (u)|  = o(l),  sup  |l_(u)  - I (u)  | = o (1)  . 

0<U<I  0<  U < 1 

We  define  analogous  to  (3.  9)  and  (3.  10) 
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(4.  5) 


Dk  ' Dk(l+  Vmi>  wlth  *k  * CLm<uic>  - Lm<uk-1»/D» 


We  observe  that  under  the  above  regularity  conditions,  we  have 


(4.6) 


max  | a , I < sup  li*(u)l  < K < X 
1 <k  <m  0<u<l  m 


The  following  theorem  gives  the  asymptotic  distribution  of  the 
* 

symmetric  statistics  T under  the  alternatives  (4.  2).  The  proof  of  this 

v 

theorem  will  follow  Lemmas  4. 1,  4.  2 and  4.  3. 

Theorem  4.1.  Suppose  that  there  exist  constants  c^  and  c^  such  that 


(4.7) 


|h(J)  | < c^j  2 + 1)  for  all  j 


* * 

Let  L (u)  satisfy  Assumption  (B  ) and  let 
m 


m 


(4.  8) 


V = V (h(S.  ) - Eh(T,))/m  ? (r 
v k=l 


where 


(4.9) 


o-2  = Var(h(r|))  - [Cov(h(n),  n)]2/Var  (n) 


and  r|  is  the  geometric  random  variable  defined  in  (1.  9).  Then  under  the 
alternatives  (4.  2) 


(4.10) 


C(V  ) -*  N(A,  1)  as  v -*•  » 


where 
(4.  11) 


A = ( f l (u)d  u)  Cov(h(i^  r|(r) -1)  - 4r|/p)  p /2(l+p)  <r  . □ 

0 
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Lemma  4. 1.  Under  the  assumptions  of  Theorem  4. 1,  we  have 


, m » t 

2 Z Z h())ff|(nDJ[(1+  A /m"4)1  expf-nD,  A,  /m‘) 

_ 1 J ^ K K k 


m 


k=l  J=0 


<4*l2>  -1  ' 0 -nDk)  Ak/m*  - {)()-!)  - 2jnD  + (nD  )2}  A2/m^r] 


= °p(1)  « 


Proof:  Using  the  Cauchy-Schwarz  inequality,  we  find  that  the  difference 

in  (4.  12)  can  be  estimated  by 

-7  v C2  Li 

m 2 £ c1<l  + (nDk)  ) [B(nDk,  Ak/m«)]2 

k-1 

where 


B(x,  y)  = V tt  (x)  [(l+y)j  exp(-xy/r)  - 1 - (J  - x/r)y 

r=o  1 

- {J(J-1)  - 2Jx/r  + (x/r)2}  yZ/2]2 

= exp(xy2/r)  - 1 - xy2/r  - x2y4/2r2  . 

2 2 3 

Therefore  for  small  (xy  ),  B(x,  y)  = 0((xy  ) ).  As  max  A = 0(1)  and 
max(mDk/log  m)  = 0^(1),  (cf.  Darling  (1953,  p.  251))  we  can  estimate  (4.12) 
by 

c /a  ^ 

(4.13)  m V g(nD.) 

k=l 

for  some  function  g(.)  satisfying  the  conditions  of  Theorem  2.  2.  Therefore 
(4.12)  is  Op(m  ♦)  which  proves  the  lemma.  □ 


Lemma  4.  2.  Under  the  assumptions  of  Theorem  4. 1,  we  have 


(4.14) 


m 


-3/4 


m » 


x.  x.h«'  vnDk)()  - nDk>Ak=  oP«" 


k-1  )=0 


Proof;  From  the  assumed  continuity  of  I m(u),  11  follows  that  for  some 

U,  where  UJ  , < U,  < U'  that 
k k-1  — k — k 


m ao 


(4.15) 


m'3/4  V V h(J)  tt  .(nD  )(J  - nD  )A 
k=l  J=0  3 

- m‘V4  X VV  Dk> 

k=l 


where 

(4.16) 


g (x)  = V h(J)  tt  (x)  (1-x)  . 
1=0 


Since  f I (u)du  = 0,  we  can  choose  0.  such  that  (k-l)/m  < 0,  < k/m 
J „ m k k 

m 0 

and  y l*  (0.  ) = 0 Also  since  1 is  continuous,  we  can  write  for  some 
m'  k'  m ’ 


k=l 


such  that  mln(0k>  Uk)  < < max(0k,  U^) 


m‘3/4|,'>k'«3<nDk' 


.1/4  # 

(4.17)  = m Yj  W fg3(n°k)  ‘ Eg3(W)l 

k=l 


-3/4  £ *•  * 

X.'m<Uk»<Uk-  V«3<nDk' 


k=l 


Now  max  f (U.  ) = O (1)  and  from  the  boundedness  of  the  Kolmogorov  - 
m k p 


Smirnov  statistic 
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itoMtMMNttMMNM 


....... 


1 . ~ 

2 


max  m 2 U.  - 0,  I = O (1)  . 
, k k p 


Using  Theorem  2.2,  we  have 
(4.  18) 


m'iA  I,  1 m 'V'“k  ' V *3(nIV  = °p,n"":» 

k=l 


Theorem  2.  2 also  gives 
<4.  19) 


1/2  m * 

m Y l (6.  ) [g  JnD.  ) - Eg  (W/p)]  = 0(1)  . 
. m k i < 3 P 

k=l 


Combining  the  estimates  (4.  18)  and  (4. 19)  in  (4. 17)  yields  the  desired 
result.  □ 

Lemma  4.  3.  Under  the  assumptions  of  Theorem  4.1,  we  have 


m 


m , « 2 

v A.  v h(J)  w.(nD  ) (10-1)  -2JnD.  + (nD.  ) }/2 
ktl  k f*l  J k k k 

^ ^2  2 2 
- ( f I (u)du)  Cov(h(n),  nlh-1)  - 4n/p)  • p /2(1+p) 

0 


in  probability  as  v -*  » . 


Proof:  Using  the  assumed  smoothness  of  t (u),  0 < u < 1 and  the 

— — m — — 

Glivenko-Cantelli  theorem,  it  follows  that 


2 *2 

max  |a.  - I (k/m+l)|  — 0 with  Probability  1. 
, . , - k m 
1<  k < m 


And  from  Theorem  2.2,  we  have 
m » 


m 


-1 


Y,  IY  h(j)  Wj(nDk)  0(J-1) -2jmDk/r  + (mDk/r)  }/2 1 = Op(l)  . 
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Hence  it  follows  that  it  is  sufficient  to  prove  the  stochastic  convergence 


(4.21) 


-1  ™ *2 

m J'|/m(k/(m+l))g5(nDk) 
k=l 


where 


(4.22)  g (x)  = V h(J)ir  (x)  (KJ-1)  - 2Jx  + x^}/2 

J=  0 J 

to  the  limit  in  the  assertion.  Again  Theorem  2.2  gives  that 

-1  m *2  1 *2 
m"  Z 95(nDk) -*  (f  f (u)du)  Eg5(w/p) 

k=l  0 


in  probability.  An  elementary  calculation  shows  that 


Eg  (W/p)  = E V h(J)ir.(W/p)  (J(J-1)  - 2j(W/p)  + (W/p)2}/2 
J=0 

rCovfhft!),  r) ( ri  — 1)  - 4r]/p)  • p2/2(l+p)2  . 


Thus  the  lemma  is  proved. 


Proof  of  Theorem  4. 1.  Following  the  method  used  in  the  proof  of  Theorem 
3. 1,  it  suffices  to  show  that 


m"  2[p^(  nD  ) - p (nD)]  -*  A in  probability. 


We  have 
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m"  ? K<n  5 ) ■ HL1/(n  D)] 

i m * # 

= m'  2 V Z h(j)  [.  (nD.)  - n (nD  )] 

k=l  J=0  ‘ ] K 

.1  m * i,  I 

= m 2 V(  V h(J)  tt  (nD  )[(1  + A ,/m4)  exp(-nD  A ,/m4) 
k=l  j=0  J K 

1 2 2 i 

-l-j-nDk  Ak/m4  - {j(j-l)  - 2jnDk  + (nDk>}Ak/m2 

3/4  m °° 

+ m E E hW  ^(nDk)U-nD.)A 

k-1  J=0  ] K * K 

m oo 

+ m_1  y Z h<J)  f,<nDJ  (JO-1)  - 2JnD  + (nD.  )2  } A */2  . 
k=l  J=0  J k < < 

Therefore  the  proof  of  Theorem  4. 1 is  complete  from  the  preceding  Lemmas 
4.1,  4.2  and  4.3.  □ 

* 

Taking  J (u)  * 0,  0 < u < 1 in  Theorem  4. 1 or  putting  j ) = h(j)  v k 

in  Corollaries  3.1,  3.1'  , we  get  the  following  result  on  the  asymptotic  null 
distribution  for  the  symmetric  statistics. 

* 

Corollary  4. 1.  Let  V be  as  defined  in  (4.  8 ),  Then  under  the  null 
v 

hypothesis  (1. 1)  V has  asymptotically  a N(0, 1)  distribution  if  the 
function  h( * ) satisfies  condition  (4. 7). 

5.  Some  further  results  and  applications  - the  nonsymmetrlc  case 
We  have  the  sequence  of  alternatives  given  by  (3.  8)  i.  e. , 

(5. 1)  G*  (y)  = y + Lm(y)/m^,  0 < y < 1 
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4m 


where 


1 _1 

Lm(u)  = m?(Gm(F  (u))  - u)  — L(u)  as  m — » . 

We  now  consider  first  the  problem  of  finding  the  optimal  choice  of 

* 

the  function  h(u,  J)  for  given  alternatives  Gm(y)  . We  use  a technique 
analogous  to  that  in  Holst  (1972). 

Theorem  5. 1.  If  the  sequence  of  alternatives  is  such  that  the  assumptions 
of  Theorem  3. 1 are  fulfilled,  then  an  asymptotically  most  powerful  (AMP) 
test  of  the  hypothesis  against  the  simple  alternative  (5.  1)  is  to  reject 
when 

m 

(5.2)  Y I (k/m+1)  S,  >c 

k=l  * 

where  f(u)  is  the  derivative  of  L(u),  mentioned  in  (5.1).  The  asymptotic 

distribution  of  this  optimal  statistic  is  given  by 

^ m _ 

(5.  3)  C ( m " 2 Y I (k/(m+l))(S,  - 1/p))  - N(0,<r  ) 

k=l 

under  HQ  with 

(5.4)  <rZ  = ( / f 2(u)du)  (l+p)/p2 

0 

while  under  the  alternatives  (5. 1) 

^ ^ 1^2  2. 

(5.5)  r(m"?  7,  l(k/(m+l))(S.  -1/p))  - N(P_  ( f / (u)du  , «r  ) 

k=l  K 0 

Proof;  From  Theorem  3. 1,  it  follows  that  the  asymptotic  power  of  the  test 

m 

which  rejects  HQ  when  J*  h(k/(m+l),  S,  ) > c is  given  by 

k=l 
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AMI 


(5.6) 


1 1 

ph  = / Cov(h(u,  r))  f (u)du/[J*  (Var  h(u,  r|))du 
0 0 

-if  Cov(h(u,  rj) , ri)du}2/Var(ri)]  2 . 

* 0 

Using  the  same  argument  as  in  Lemma  3.  1 of  Holst  (1972),  we  have  that 
this  quantity  is  maximized  when 

(5.7)  h(uj)  = f(u)  • j . 

The  results  on  the  asymptotic  distributions  follow  directly  from  Theorem 
3.  1 and  Corollary  3.  1 for  the  above  special  case.  □ 

From  this  result,  it  follows  that  the  AMP  test  of  level  a is  ex- 
plicitly given  by : Reject  HQ  if 

m 1 — 

(5.8)  [v  / (k/(m+l)(S  - l/p)]/[m(  f I (u)du)  (1 +p)p‘  ] ? > *■ 

k = l 0 

where  X is  the  upper  o-percentile  of  the  N(0, 1)  distribution.  Also 
from  Theorem  5.  1,  we  find  that  the  asymptotic  power  of  this  test  in  terms 
of  the  standard  normal  cdf  is  given  by  the  expression 

(5.9)  + (f  i 2(u)du/(l+p))  ^)  . 

° 0 

Furthermore  it  is  easily  seen  from  Theorem  3.  1 that  the  Pitman 
Asymptotic  Relative  Efficiency  (ARE)  in  using  h(u,  j)  = d(u)  • j instead  of 
the  optimal  h(u,  j)  = f(u)  • J is 
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1 2 1 2 1 2 1 2 
(5.  10)  e = (/  d(u)  l(u)du)  / {f  d (u)du  - ( J d(u)du)  }{J  I (u)du} 

0 0 0 0 


5.  A Applications;  Translation  alternatives 

We  now  consider  some  applications  of  the  above  results  on  non- 

symmetric  tests.  First  we  shall  look  at  the  translation  alternatives.  Let 

X,,  ....  X , be  absolutely  continuous  i.  i . d.  random  variables  with  dis- 
1 m-1 

tribution  function  F(x)  . Let  Y.,...,Y  be  i.  i.  d.  with  d.  f.  G(x)  . We 

1 n 


wish  to  test 


HQ:  G(x)  = F(x) 


against  the  sequence  of  translation  alternatives 


(5.11) 


(11  - 
A'  G(x)  = G (x)  = F(x  - e/m2) 
m m 


Let  f(x)  = F'(x)  be  continuous.  Then  as  m-*», 


- -1  -1 

(5.12)  Lm(u)  = m'[Gm(F  (u))  - u]  - -0f(F  ‘(u))  = L(u),  say. 


And  if  f'(x)  exists  and  is  continuous  except  for  at  most  finitely  many 
x's  then,  at  the  continuity  points  of  f'(F  \u)),  we  have 


(5.13) 


I (u)  - f (u)  = -0  f (F“1(u))/f(F"1(u)) 
m 


We  now  apply  Theorem  5. 1 to  the  following  special  cases  to  obtain 


the  asymptotically  optimal  test  statistics  based  on  {S^}  . 
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Ex.  1 (Wilcoxon  test) 


Consider  the  logistic  distribution  function 


F(x)  = l/(l+e  X),  -so  < x < » . 

In  this  case,  we  get 

-ffF'^uH/ffF'^u))  = 1 - 2u 

and  the  regularity  assumptions  are  all  satisfied.  Therefore  the  optimal 
statistic  from  Theorem  5. 1 is  based  on 
m m-1 

(5.14)  Y (1  - 2k/(m+l))  S,  =2  V R /(m+1)  - (m-l)m/(m+l) 

k=l  K I K 

+ 2n/(m+l)  - n 

where  Rj R^  ^ are  the  ranks  of  the  X-values  in  the  combined  sample. 

m-1 

Observe  that  the  Wilcoxon  2-sample  test  is  also  based  on  Y R . The 

k=l  k 

optimality  of  the  Wilcoxon  statistic  for  the  logistic  distribution  is  well- 
known.  See  e.g.  Lehmann  (1959),  p.  238.  Theorem  5.1  gives  the  asymp- 
totic distribution  of  the  Wilcoxon  test  as  a special  case. 

Ex.  2 (A  van  der  Waerden  or  normal  score  type  test) 

For  the  normal  d.  f. 

-I  / 2 

F(x)  = 4>  (x)  = (2it)  2 J exp(-|  t )dt,  -so  < x < so 

-CO 

we  find 

-f'(F-1(u))/f(F"1(u))  = ® -1(u)  . 
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It  can  be  verified  that  the  required  regularity  conditions  of  Theorem 


3.  1 are  satisfied.  Hence  the  AMP  test  is  based  on  the  statistic 


(5.15) 


from  Theorem  5.1.  Using  the  facts 


we  have  under  the  null  hypothesis  that 

(5.16)  t(T/mV  N(0,  (1+p)/p2)  . 

From  Theorem  5.1,  the  asymptotic  power  for  a one-sided  test  of  level  a 

_ l 

is  3>(-  +■  0(l+p)  2),  the  same  as  that  of  the  Student's  t-test. 

To  find  the  Pitman  efficiency  of  the  Wilcoxon  test  with  respect  to  the 
test  based  on  (5.15)  (which  is  optimal)  in  this  case,  we  only  need  to 
calculate  (5. 10).  Since 
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i <4. 


Mb 


t ’ 


1 2 1 -1  z 

and  J (2u-l)“du  = 1/3,  J ($  (u))  du  = 1,  using  formula  (5.10)  the  ARE 

0 0 

of  Wilcoxon  test  versus  the  normal  scores  type  test 
(5.  17)  e = 3/tr  . 

The  test  statistic  (5.  15)  has  the  same  asymptotic  properties  as  the 
Fisher-Yates-Terry-Hoeffding  and  van  der  Waerden's  rank  tests. 


Ex.  3 (A  median  test) 

Consider  the  double  exponential  distribution  with  density 


f(x)  = |e 


- x 


- X < x < 30 


We  find  that  the  conditions  of  Theorem  3.  1 are  satisfied  and 


, 1 for  u < j 

f’(F~  (u))/f(F_  (u))  =< 

1-1  for  u > } . 


Hence  the  optimal  test  is  based  on  the  statistic 
(5.18) 


[m/2]  m [m/2] 

T,  sk  - T sk  - 2 Z sk  - " . 

lc= 1 k=fm/2]+l  k=l 


[m/2] 

where  ^ S,  is  the  number  of  Y' s below  the  median  of  the  X’s  . This 

tii  k 

is  different  from  the  usual  median  test.  We  easily  find  that  under  the  null 


hypothesis, 
(4. 19) 


[m/2]  i 2 

f Y,  - nl/m?)  - N(0,  (l+P)/p  ) • 


k=l 


The  ARE  under  normal  alternatives  is  obtained  from 
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yr  A 


-f  $>  (u)du  + f (u)dx  = 2 ft'  (u)du  = (2/ir)? 

'o  Jj 

which  gives 

(5.  20)  e = 2/tt  . 

Ex.  4 For  the  Cauchy  distribution,  we  find 

(5.21)  / (u)  = 2 tamr(u-|)/[l+ {tan  w(u- } ) }2]  . 

Notice  that  1(0)  = t (1)  = 0 and  small  weights  are  given  to  the  S^'s  in 
the  tails.  Heuristically  this  is  appropriate  since  the  Cauchy  distribution 
has  heavy  tails  and  hence  random  fluctuations  are  large. 

5.  B Applications;  Scale  alternatives 

Next  we  consider  absolutely  continuous  positive  random  variables 

under  scale  alternatives.  Let  X, X . be  i.  i.d.  F(x)  and  Y......Y 

1 ’ rn-1  1 ’ n 

be  i.  i.d.  G(y)  with  F(0)  = G(0)  = 0 . We  wish  to  test 

(5.22)  Hq:  G(x)  = F(x)  , 

against  the  scale  alternatives 

(5.2  3)  Hjm);  G(x)  = Gm(x)  = F(x(l+ e/m^))  . 

If  the  density  f(x)  = F'(x)  is  continuous,  then  as  m -»  » , 

1 _1  _i  _i 

(5.24)  L (u)  = m2(G  (F  (u))-u)  - L(u)  = -0f(F  (u))  • F (u)  . 

m rn 
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And  if  f'(x)  exists  and  is  continuous  except  for  finitely  many  points, 
then  analogous  to  (5.13), 

(5.25)  l (u)  -*  f (u)  = -0[1  + f'(F_1(u))  • F"1(u)/f(F’1(u))] 

m 

where  P exists.  Optimal  statistics  based  on  {Sk}  can  be  derived  just 
as  in  the  case  of  translation  alternatives. 

Ex.  5.  (Savage  or  exponential  score  test) 

For  the  exponential  distribution  F(x)  = (1  -e'X)  for  x > 0 we  find 


(5.26) 


l(u)  = - 0(1  + log(l  - u))  . 


The  assumptions  of  Theorem  3. 1 can  be  verified  and  hence  an  optimal 
statistic  is  given  from  Theorem  5.1,  by 

m 

(5.27)  T = V log(l- k/(m+l))  (S  -1/p)  • 

k=l 

1 2 

Since  J (1  + log(l-u))  du  = 1 we  get  that 
0 


f (T/m*)  - N(0,  (1+p)/p2  ) 


(5.  28) 


and  that  the  asymptotic  power  is 


it 


29>  $ (-\  + 0(l+p)'2  ) . 

The  ARE  of  Wilcoxon  statistic  relative  to  T in  (5.27)  above  is  3/4 

The  statistic  T is  an  approximation  to  the  Savage  statistic  (see  Lehmann 

(197  5)  p.  103).  The  UMP  test  for  the  above  situation  is  the  test  based  on 
m-1  n 

k^-'l  Xkk^l  Yk  WhlCh  haS  the  Same  asymptotlc  P°wer  (5*  29)  as  the  statistic 
T in  (5.  27). 

Ex.  6.  (Capon  and  Klotz  test) 

Consider  the  folded  normal  distribution  with  distribution  function 

^5*  3°)  F(x)  = 2<X>  (x)  - 1 for  x > 0 . 

The  assumptions  of  Theorem  3.1  can  be  verified  and  we  find  the  optimal  test 
statistic  using  the  function 


*(u)  = 1 - [#  _]((l+u)/2)]2 


From  Theorem  4. 1,  the  statistic 


(5.  31) 


m 


T = Z _1f(l  + k/(m+l))/2])2  S 
k=l  x 


has  under  the  null  hypothesis,  the  distribution  given  by 

(5,  32>  * (T/m*)  - N(0,  2(l+p)/p2) 

and  the  asymptotic  power 

-55- 


t&s, 


,,  ■ *•*  ''' 


i 


®[-*a  + e(2/(i+P))*]  . 

Again  this  asymptotic  power  is  the  same  as  that  of  the  conventional  F- 
test. 

6-  Some  further  results  and  applications  - the  symmetric  case 

In  this  section,  we  consider  the  sequence  of  alternatives  given  by 

(4.2)  and  symmetric  test  statistics  of  the  form 

m 

(6.1)  T = l h(S  ) . 

v k=l  K 

As  in  Section  5,  we  first  consider  a result  on  the  optimal  choice  of 
the  function  h(* ) . 

Theorem  fe.  1 . For  the  sequence  of  alternatives  given  by  (4.2),  satisfying 
the  conditions  of  Theorem  4. 1,  the  asymptotically  most  powerful  (AMP) 
test  is  of  the  form:  Reject  when 

m 

(6.2)  V S (S  -l)>c  . 

k=l  K K 

Proof;  From  Theorem  4.  1,  it  follows  that  the  asymptotic  power  of  a test  of 
the  form  (6.  1)  is  a maximum  when  the  quantity  A given  in  (4. 11)  is  maxi- 
mized. Observe  that 

(6.  3)  Cov(ri,  Ti(ri-l)  - 4r|/p)  = 0 

and 

(6.  4)  Varfhfr,))  - Cov2(h(r1),  r^/Varfn)  = Varfh^)  - pn) 

where  p is  the  usual  linear  regression  coefficient 

P = Cov(h(^),^)/Var(n)  . 
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(6.  5) 


Therefore  we  can  rewrite 

2p  Z(l+P)2  A//  f*2(u)du  = Cov(h(n)-Pri,  ^(ri-l) -4T1/p)/[Var(h(T1) -Pr,) ] 1 
0 

= Cor  (h(ri)  -p-r),  t| ( t)  — 1)  -4r\/ p)  ’ [Var(r|(r|-1) -4r|/ P)] 

< [Varf^-ll-Wp)]*  = 2p'2(l  + p) 

with  equality  in  (6.  6)  if  and  only  if 

h(r|)  - Pr|  = a[r|(r|-l)  -4r|/  p]  + b 

for  some  real  numbers  a and  b . Thus  A is  maximized  by  h(-r|)  = t|(t)  - 1) 
and 

1 


(6.7) 


max  A = J f (u)du/(l  + p) 
h 0 


□ 


Using  Theorem  4. 1,  we  further  have  that  under  HQ  , 


(6.8) 


m 2 1 ? 

JStfpj  Sk(Sk-l)  - 2m/ p ]/m2[2p"  (1+p)])  - N(0,1) 


and  that  the  asymptotic  power  for  a test  of  level  c is 


1 


®f-X.a  + (J  l*2(u)du)/(l+p)]  . 

Further,  from  the  above  proof  we  see  that  the  ARE  in  using  h(Sk) 
instead  of  S^( S^. -1)  is 


(6.9) 


e = Cor  (h(ri)  - Pq,  -n(ri-l) -4ri/ p)  . 


m 


m 


The  statistic  Y,  which  is  equivalent  to  Y Sk(Sk~l)  was  proposed 


k=l 


k=l 
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by  Dixon  (1940).  Blumenthal  (196 3)  and  Rao  (1969)  discuss  the  ARE  of 
this  test  while  Blum  and  Weiss  (1957)  consider  the  consistency  properties. 
Blum  and  Weiss  (1957)  also  show  that  the  Dixon  test  is  asymptotically 
LMP  against  "linear"  alternatives  with  density  {l  + c(y  - |)},  0 < y < 1 

( ] c | < 2)  but  we  have  shown  that  Dixon  test  is  indeed  AMP  against 
alternatives  of  the  form  (i.  2). 

For  a nonnegative  integer  r,  if  we  define 

j 1 for  x = r 

(6.10)  h(x)  = < 

1 0 otherwise  , 

then 

T*v = 1,  h,v 

k=l 

is  the  statistic  Qm(0»  the  proportion  of  values  among  {S^}  are 

equal  to  r . This  statistic  has  been  discussed  in  Blum  and  Weiss  (1957) 
from  the  point  of  consistency.  Our  results  establish  the  asymptotic  norm- 
ality of  0 (0  under  H as  well  as  under  the  sequence  of  alternatives  (4.2). 
m 0 

After  some  computations  we  find  from  Corollary  4.1  that  under  the  null 
hypothesis 

(6.11)  £(  m2  (Qm(r)  - p/(l+p)f+1])  - N(0,  <r2) 
where 

<r2  = {p/(l+P)r+1}  [1  - (p/(l+P)r+S  {1  + (r-l/p)2  (p2/(l+p)}]  . 


(6.12) 


The  Wald-Wolfowitz  run  test  (1940)  Is  related  to  Q (0)  . Let  U be  the 

m 

number  of  runs  of  X's  and  Y' s in  the  combined  sample.  The  hypothesis 

H.  is  rejected  when  U/m  is  too  small.  From  the  definition  of  Q (r)  , 

0 m 

it  follows  easily  that 

(6.13)  |(U/m)  - 2(n/m)  (1 -Qm(0))|  < 1/m  . 

Thus  the  asymptotic  distribution  of  U/m  is  the  same  as  that  of  2p(l-Q  (0)) 

m 

and  we  thus  have,  under  H , 

(6.14)  f.(  m*  [(U/m)  - 2/(l+p)])  -N(0,4p/(l+p)3)  . 

Therefore  the  ARE  of  the  run-statistic  against  the  Dixon's  statistic  is 
p/(l+p)  as  has  been  shown  in  Rao  (1969). 

7.  Further  remarks  and  discussion 

It  is  interesting  to  note  that  the  theory  developed  in  this  paper  gives 
tests  based  on  (Sk)  which  are  asymptotically  equivalent  to  the  correspond- 
ing rank  tests  in  all  the  known  situations  discussed  in  Section  6.  For  a 
unified  approach  to  the  theory  of  rank  tests  see  Chernoff  and  Savage  (1958) 
or  Hajek  and  Sidak  (1967).  We  conjecture  that  this  property  is  true  in  gen- 
eral i.  e. , given  any  rank  test,  one  can  construct  a test  of  the  form  (5.  2) 
which  has  asymptotically  the  same  null  distribution  and  power.  If  this  is 
the  case,  then  the  theory  presented  here  seems  to  lead  to  much  simnler  test 
statistics  as  compared  to  the  corresponding  optimal  rank  tests.  It  is  inter- 


(Sk)  while  this  is  not  the  case  with  the  ranks.  Further  relationships  be- 
tween these  two  groups  of  tests  is  under  investigation.  It  may  also  be  re- 
marked that  the  theory  presented  here  covers  the  asymptotic  theory  of  many 
tests  that  are  not  based  on  ranks  as  for  instance,  the  run  test  and  the  medium 
test. 


The  theorems  presented  here  can  also  be  applied  to  study  similar  test 
statistics  when  the  samples  are  censored.  For  instance,  suppose  that  the 
samples  are  censored  at  the  right  by  X^m  , the  [(m-l)q]th  order 
statistic  in  the  X-sample.  Under  the  same  assumptions  as  in  Theorem  5.1, 
we  obtain  in  the  same  way  that  optimal  test  statistic  is  given  by 


(7.1) 


[(m-l)q] 

T=  £ !(k/(m+l))(Sk-l/p) 


k=l 


Under  HQ  , 

i 9 9 

c(T/m  2)  - N(0,  f f l2(u)du  - (/  l(u)du))2]  (p+l)/p2) 

0 0 

and  the  asymptotic  power  is 

9 9 9 i 

(7.2)  $>(-\  + (/  1 2(u)du)/{[ J f 2(u)du  - ( J l(u)du)2]  (1+p)}  2)  . 

a 0 0 0 

On  the  otherhand  censoring  in  rank  theory  can  not  be  treated  as  simply  as 
this.  See  for  instance  Rao,  Savage  and  Sobel  (i960). 

Results  on  the  asymptotic  theory  of  general  statistics  based  on  {Sk} 
have  been  obtained,  under  the  null  hypothesis,  by  Holst  (1976b)  using  a 
different  approach.  However  the  approach  used  there  does  not  seem  to  yield 
the  asymptotic  theory  under  the  alternatives,  of  the  kind  derived  here. 
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Throughout  the  paper  we  formulated  the  alternatives  In  terms  of  the 
sample  size  m of  the  first  sample.  But  a symmetric  way  of  formulating 
the  alternatives  would  be  to  express  it  in  terms  of  N = (m+n)  . We  then 
have 

N*(G„(F'l(u)) -u)  - L(u)  (1  + 1/p)*  = L.(u),  say. 

N t 

1 

In  terms  of  LJ(u)  = fjfu)  = I (u)(l+l/p)2,  we  see  that  the  asymptotic  power 
of  this  alternative  can  be  expressed  as 

1 2 -1  - 
4>[-X  + ( f l (u)du/(l+p)(l+p  ))?]  . 

or  JQ  1 

If  fj(u)  does  not  derend  on  p,  then  p=l  maximizes  the  power  i.  e. , 
in  large  samples,  choosing  m = n or  equal  sample  sizes  is  optimal. 


REFERENCES 


[1]  Blum,  J.  R.  and  Weiss,  L.  (1957),  Consistency  of  certain  two- 
sample  tests.  Ann.  Math.  Statist.  , 242-246. 

[2]  Blumenthal,  S.  (196  3),  The  asymptotic  normality  of  two  test  statistics 
associated  with  the  two-sample  problem.  Ann.  Math.  Statist.  34, 
1513-1523. 

[3]  Chernoff,  H.  and  Savage,  I.  R.  (1958),  Asymptotic  normality  and 
efficiency  of  certain  nonparametric  statistics.  Ann.  Math.  Statist. 

22,  972-994. 

[4]  Courant,  R.  and  Hilbert,  D.  (1962),  Methods  of  Mathematical  Physics, 
Vol.  2.  Interscience,  New  York. 

[5]  Darling,  D.  A.  (1953),  On  a class  of  problems  related  to  the  random 
division  of  an  interval.  Ann.  Math.  Statist.  2j , 239-253. 

[6]  Dixon,  W.  J.  (1940),  A criterion  for  testing  the  hypothesis  that  two 
samples  are  from  the  same  population.  Ann.  Math.  Statist.  JJL, , 
199-204. 

v * 

[7]  Hajek,  J.  and  Sidak,  Z.  (1967),  Theory  of  Rank  Tests.  Academic  press, 
New  York. 

[8]  Holst,  L.  (1972),  Asymptotic  normality  and  efficiency  for  certain 
goodness  of  fit  tests.  Biometrika  59  . 137-145. 

[9]  Holst,  L.  (1976a),  On  multinomial  sums.  Math.  Research  Ctr. , 

Univ.  of  Wisconsin  Tech.  Report  #1629. 


-62- 


[10]  Holst,  L.  (1976b),  Some  limit  theorems  for  the  indistinguishable  ball 
problem  with  applications  in  nonparametrics.  Math.  Research  Ctr.  , 
Univ.  of  Wisconsin  Tech.  Report. 

[11]  Le  Cam,  L.  (1958),  Un  theoreme  sur  la  division  dpun  intervalle  par 
des  points  pris  au  hasard.  Publ.  Inst.  Statist.  Univ.  Paris  7,  7-16. 

[12]  Lehmann,  E.  L.  (1959),  Testing  statistical  hypotheses.  John  Wiley, 
New  York. 

[13]  Lehmann,  E.  L.  (197  5),  Nonparametrlc  statistical  methods  based  on 
ranks.  Holden-Day,  Inc. , San  Francisco. 

[14]  Mann,  H.  B.  and  Whitney,  D.  R.  (1947),  On  a test  of  whether  one 
of  two  random  variables  is  stochastically  larger  than  the  other. 

Ann.  Math.  Statist.  18,  50-60. 

[15]  Pyke,  R.  (1965),  Spacings.  Jour.  Roy.  Statist.  Soc.  . Ser,  B 22 . 
395-449. 

[16]  Rao,  C.  R.  (197  3),  Linear  statistical  inference  and  its  applications. 
John  Wiley,  New  York. 

[17]  Rao,  J.  S.  (1969),  Some  contributions  to  the  analysis  of  circular  data. 
Unpublished  Ph.  D.  thesis,  Indian  Statistical  Inst.,  Calcutta. 

[18]  Rao,  J.  S.  and  Sethuraman,  J.  (1969),  Pitman  efficiencies  of  tests 
based  on  spacings.  Tech.  Repori  No.  11.  Research  and  Training 
School,  Indian  Statistical  Inst. , Calcutta. 


-63- 


fl9]  Rao,  J.  S.  and  Sethuraman,  J.  (197  5),  Weak  convergence  of  empirical 
distribution  functions  of  random  variables  subject  to  perturbations 
and  scale  factors.  Ann.  Statist.  ^ , 299-313. 
f20]  Rao,  U.  V.  R. , Savage,  I.  R.  and  Sobel,  M.  (I960),  Contributions 
to  the  theory  of  rank  order  statistics;  The  two  sample  censored  case. 
Ann.  Math.  Statist.  JJ,  415-426. 

[21]  Wald,  A.  and  Wolfowitz,  J.  (1940),  On  a test  whether  two  samples 
are  from  the  same  population.  Ann.  Math.  Statist.  4,  147-162. 


-64  - 

/ 


I 


